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The 

RICIS 

Concept 


The University of Houston-Clear Lake established the Research Institute for 
Computing and Information systems in 1986 to encourage NASA Johnson Space 
Center and local industry to actively support research in the computing and 
information sciences. As part of this endeavor, UH-Clear Lake proposed a 
partnership with JSC to jointly define and manage an integrated program of research 
in advanced data processing technology needed for JSC’s main missions, including 
administrative, engineering and science responsibilities. JSC agreed and entered into 
a three-year cooperative agreement with UH-Clear Lake beginning in May, 1 986, to 
jointly plan and execute such research through RICIS. Additionally, under 
Cooperative Agreement NCC 9-16, computing and educational facilities are shared 
by the two institutions to conduct the research. 

The mission of RICIS is to conduct, coordinate and disseminate research on 
computing and information systems among researchers, sponsors and users from 
UH-Clear Lake, NASA/JSC, and other research organizations. Within UH-Clear 
Lake, the mission is being implemented through interdisciplinary involvement of 
faculty and students from each of the four schools: Business, Education, Human 
Sciences and Humanities, and Natural and Applied Sciences. 

Other research organizations are involved via the “gateway” concept. UH-Clear 
Lake establishes relationships with other universities and research organizations, 
having common research interests, to provide additional sources of expertise to 
conduct needed research, 

A major role of RICIS is to find the best match of sponsors, researchers and 
research objectives to advance knowledge in the computing and information 
sciences. Working jointly with NASA/JSC, RICIS advises on research needs, 
recommends principals for conducting the research, provides technical and 
administrative support to coordinate the research, and integrates technical results 
into the cooperative goals of UH-Clear Lake and NASA/JSC. 
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Executive Summary 


Sensory inputs from the visual, vestibular and proprioreceptive systems are integrated by the central 
nervous system to maintain postural equilibrium. Sustained exposure to microgravity causes 
neurosensory adaptation during spaceflight, which results in decreased postural stability until 
readaptation occurs upon return to the terrestrial environment. Data which simulate sensory inputs under 
various conditions have been collected in conjunction with Johnson Space Center postural control 
studies using a tilt-translation device. The University of West Florida proposed applying the fuzzy c-means 
clustering algorithms to this data with a view towards identifying various states and stages. 

Data supplied by NASA/JSC via Tom Collins, Krug Life Sciences, were submitted to the Fuzzy c-Means 
(FCM) algorithms In an attempt to identify and characterize cluster substructure in a mixed ensemble of 
pre and post adaptational TTD data. Following several unsuccessful trials with FCM using a full 11 
dimensional data set, we discovered a set of two channels (features) enables FCM to separate pre from 
post. Our main conclusions are that FCM seems able to separate pre from post TTD subject #2 on the one 
trial we have used so far, but only in certain subintervals of time; and that Channels 2 (right rear transducer 
force) and 8 (hip sway bar) contain better discrimination information than other supersets and 
combinations of the data we have tried so far. 
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2. Project Description 


Subsequently, we have been studying separability of pre and post across the time axis, by subdividing 
the data into "time slices" or subintervals of the 20 second data collection epoch. So far, our tests have 
involved only one subject and one TTD test. For this data, several measures of separability, as well as the 
error rate computed by resubstitution of the training data, suggest the FCM can separate certain time 
slices much more readily than others. We will continue to widen our computational experience, by 
enlarging the tests we have conducted across different trials on the same subject, and then different 
subjects. 

Fuzzy c-Means 

Let (c) be an integer, 1< c<n and let X = {x 1 , x 2 x n } denote a set of (n) feature vectors in Jl p . X is 

numerical object data-, the j-th object in this study is a set of p measurements of sensor signals at time t. To 
be technically accurate, the notation for the posture control data should be something like Xj = x(tj), j = 1 , 2 , 

... , n; however, in the interests of clarity we will suppress the dependency of the feature vectors on time. 
xj k is, for this data, the j-th channel value associated with time k. Given X, we say that (c) fuzzy subsets 
{Uj:X** [0,1]} are a fuzzy c-partition of X in case the (cn) values (u jk = Uj(x k ), 1<k<n, 1<i<c} satisfy three 

conditions: 


0<Uj k < 1 for all i.k ; 


(la) 

£u jk = 1 for all k ; 

and 

(1b) 

0 < £ u jk < n for all i. 


(1c) 


Each set of (cn) values satisfying conditions (1) can be arrayed as a (cxn) matrix U = [u jk ]. The set of all such 
matrices are the non-degenerate fuzzy c-partitions of X: 

Mj cn = {U in Jl cn | u jk satisfies conditions (1) for all i and k}. (2) 

And in case all the u ik 's are either 0 or 1 , we have the subset of hard (or crisp) c-partitions of X: 

Men = {U in M {cn | u ik = 0 or 1 for all i and k}. (3) 
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Data structures identified by partitions which are optimal in the sense of minimizing the function defining 
them often provide good insights and explanations into substructure of the process that produced the 
data. The FCM functional is as follows: 


j ro (U.v ; x) - ra ik m (||x k .v i || A ) 2 


me [1 , oo) is a weighting exponent on each fuzzy membership; (6a) 

U e M {cn is a fuzzy c-partition of X; (6b) 

v = (v-j , v 2 , .... Vg) are cluster centers in 51 s ; ^ 

A = is any positive definite (s x s) matrix; and (6d) 

||x k -Vj|| A - (x k -Vj)"^A (x k -Vj) is the OG distance (in the A norm) from to Vj (6e) 


Conditions necessary for a local minimum of J m are as follows: 

Fuzzv c-Means (FCM) Theorem T41. (U.v) may minimize I£ Uj k m (||x k - Vj|| A ) 2 for m>1 only if : 

u jk = ( Z ||xk-vi(| A /IIXK-Vjll^-^-l) for all i,k ; and (7a) 

Vj = 2:(Uj k ) m x k /2^Uj k ) m for alii (7b) 


The FCM algorithms are simple Picard iteration through (8) : 

Fuzzv/Hard c-Means fFCM/HCMt Algorithms f21. 

<FCM/HCM 1> : Given unlabeled data set X = {x 1 , x 2 x n } . Fix : 1 < c < n; 1 < m < ~ (/ m=1 for HCM); 

positive definite weight matrix A to induce an inner product norm on H s ; and c, a small positive constant. 

<FCM/HCM 2>: Guess v Q - ( v 1 Q , v 2 Q v Q Q ) e R cs (or, initialize U Q e M fcn ). 

<FCM/HCM 3>: For j = 1 to J: 

<3a> : Calculate U= with {v- . 1 } and (8a) ; 

J *»r * 

<3b>: Update Vj ■ ^ to Vj j with Uj and (8b), 1< i < c 

<3o: lLmax{ ||Vj = 1 - Vj j || }< c, then stop and put (U*,v*) = (Uj.Vj); £Ise : Next j 
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Configuration of the Posture Control Data 

The following conceptual arrangement of the data will be referred to in subsequent discussion. We regard 
the data as an array of size (p x 4000), where p= number of features (channels) used in the processing. 
Each column of the matrix is thus a vector in Jl p ; and each row of the matrix contains the observations 
collected by one sensor at each point in time. Further, the data are labeled as pre and post, so the overall 
data matrix is partitioned at column 2000 (the final observation time). EMG data were sampled at four times 
the frequency of transducer data, so we decimated the EMG data in order to align them with the 
transducer samples. 

The basic data set then consists of 2000 samples taken across a 20 second time interval with sensors 
attached to a subject at 1 1 locations (channels). Data were collected both before (pre) and after (post) a 
subject was exposed to roughly 30 minutes in the TTD with one of six trial environments (Trials 1 -6). When 
using FCM, rows of the data matrix X in Figure 1 correspond to features. For p=1 1 , all of the data channels 

are used. Choosing, e.g., features 2,5, and 8 corresponds to reading and processing only those three 
rows of X. The vector x pre 1 which is highlighted in Figure 1 is a column vector with p entries : * pre j = 

(x „ „ x „ „ x * It will be convenient in our discussion to subscript data sets as follows: 

v pre.l ,1* pre.l ,2 pre.l ,p' 

Xjj = data matrix for subject i, trial j l<i<5and 1 <j<6. (8) 


Conceptually, the data matrix has the following configuration: 

Figure 1. Arrangement of the Posture Control Data for one subject for one trial 
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The 1 1 features in X are labeled as shown in Table 1 (Feature # F = NASA Channel # C): 


Table 1. Posture Control Features 


T| 

II 

o 

Location 

1 = 1 

left front transducer force 

2=2 

right front transducer force 

3 = 3 

shear force transducer 

4=4 

left rear force transducer 

5=5 

right front force transducer 

6=7 

shoulder sway bar 

7=8 

hip sway bar 

8 = 11 

soleus 

9 =12 

hamstrings 

10 =13 

tibialis 

11 = 14 

quadriceps 


Feature Selection 


After several runs using all 11 channels, each of which produced uninterpretable results, we performed 
several statistical analyses (principle components and MANOVA) in an attempt to find transformations of 
the data that would give better results in 1 1 space. These attempts were also short lived, and seemed to 
produce nothing useful. Finally, we resorted to a graphical plot of the raw signals in all 1 1 channels, and 
used visual inspection to select the signal channels that seemed most likely to possess good 
discriminatory power. The features (channels) selected for further analysis were as follows: 


Channel 2 = right rear force transducer 
Feature Set 1 Channel 5 = right front force transducer 

Channel 8 = hip sway bar 


None of the EMG data seemed, upon visual inspection at least, to contain information that could be used 
to elicit classification, so we abandoned processing on these channels early in the study. At the 
suggestion of Tom Collins, we also tried the following sets of three features: 


Feature Set 2 


Channels (1+2+4+5)/4 = ave. left, right.front.rear force transducers 
Channel 3 = shear force transducer 
Channel 8 = hip sway bar 


Feature Set 3 


Channels (1+2+4+5)/4= ave. left, right.front.rear force transducers 
Channel 3 = shear force transducer 
Channel 7 = shoulder sway bar 
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However, neither of these feature sets seemed to produce better results than the channel 3-tuple {2,5,8}. 
We will continue to experiment with Feature sets 1 , 2 and 3 as our computational base widens to include 
further trials and subjects. 

Initialization of FCM for the Posture Control Data 


Since X is labeled (pre or post), we can initialize FCM in step FCM 2 with U L , the hard partition that labels 
the dat a Moreover, the number of classes is known, c=2. Thus, partition U L is the 2 x 4000 matrix : 


1 1 1 

1 1 

000 

0 0 


0 0 0 .....0 0 

1 1 1 1 1 


pre 
9* post 


(9) 


This initialization cannot be used, of course, with unlabeled data, so initialization procedures for FCM will 
have to be widened as the study progresses. For calculations on time subintervals, a label matrix in the 
form of (10), adjusted to the correct subsize, is used to initialize FCM, and is the basis for computation of 
the resubstitution error rate described next. 


Measures of Performance and Separability 

We use three performance indices to guide our analysis of the data. The primary measure of performance 
is the observed label error rate E L (U, X^) for U in M cn - This is computed by first converting any terminal 

fuzzy c-means partition, say UpQ^, into a hard partition by thresholding with the so-called method of a- 
cuts. Specifically, for a chosen membership threshold a e [0,1], we define the hard c-partition U a derived 
from UpQ M as follows: 

For cols j for which there is a row i in UFCM such that U FCM.i| 2 ”■ Vii ' ' ’ Vi ‘ * i: 3nd ° lt ' e ™ ,ISe ’ 

For cols j for which there is no row i in UFCM such that u fcm ij - a * declare "no label for j" . 

Because "no label for j" columns of U a do not contain a "1" in any row, U a is not, strictly speaking a hard 
partition of the data. This can be accounted for in a formal way by adding a c+1-st row to U a and U L , with 
zeroes in every column of U^, and (placed) Ts in each column of U a where “no label" occurs. After the 
hard “partition" U a has been determined, we compute the label error rate as follows: 

E L< u FCM- x ii)- II l u L,ij' u a,ii |/2n L (10) 
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where n L is the number of labeled data used for the run. E L is simply the number of times that the labels 
in U a disagree with the given labels divided by the total number of trials (samples) used to generate 

U FCM‘ 

We are also using two measures of separability of the data that are related to E^ t and are thus most 

accurately regarded as'second order"measures of classifier performance. The reason we are studying 
these measures is to find a means for detecting, in unlabeled d ata, when the data are being well 
separated, since the error rate E L cannot be computed with unlabeled data in on-line processing during 

dat acquisition. These measures are as follows: 

Cluster Center Separation (c=2, Euclidean Norm, ) 

DV < V FCM> = 11 V FCM, pre - V FCM, post H (1 1 ) 

It is intuitively plausible, but not mathematically necessary, that DV increase as the clusters that have 
V FCM pre and V FCM post as ,heir prototypes become increasingly well separated. This is illustrated 

schematically in Figure 2: 

Figure 2. Geometric Rationale for the measure of Cluster Center Separation 
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Partition Entropy 


H < U FOM)-‘ EZ “ik ln .(Uik |/n L < 12 > 

This index is simply one measure of "fuzziness" of any partition in M jcn . The value of H is 0 on every hard 
U in M„, and increases to a maximum of ln.(c) « 0.69 ... for c=2 as U approaches the “fuzziest" partition, 
namely, Ujj - (1/c) for all i and j. Thus, the better separated the clusters, the harder is U F q M , and the closer 
H(Ufcm) is zerD ' H is oflen ^ 001 always!) a good indication of (algorithmic) separability of X, so we will 

track its ability to indicate when FCM is doing well. 

Initial Results and Discussion 

Several oins using Feature set #1 = Channels {2,5, 8} and various 2 dimensional subsets thereof were 
made on all 4000 data vectors. These runs were notable only in their lack of success. Following this, we 

began subdividing X into time slices, and processing subinterval data sets. That is, we took a vertical 
subslice through the matrix X in Figure 1 , adjusted U Q and n^, and submitted the data to FCM. This has 

been done over a number of different feature subsets and time slices. The most striking results are 
illustrated in Table 2 and Figure 3, which is a plot of the data in Table 2. 

Table 2. Performance Indices for data X 22 on time slices of 2 seconds each 


T(secs) 

DVInit 

DVfinal 

100*Entropy 

% Error 


DV ( v FCM,iW 

DV(v FCM, final* 

10 °‘ H ( U FCM> 

100^(U FCM .Xij) 

2 

12 

159.36 

12 

50 

4 

24 

42.83 

36 

47.5 

6 

130 

137.88 

13 

6 

8 

70 

115.51 

14 

22.25 

10 

232 

232.63 

3 

0 

12 

23 

84.32 

40 

61.5 

14 

221 

221.21 

5 

0 

16 

158 

185.64 

15 

14.25 

18 

89 

91.21 

17 

1.75 

20 

20 

71.9 

40 

55 
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Figure 3. Performance indices for data X 22 on time slices of 2 seconds (cf. Table 2) 



DVinit 

DVfinal 

100H 

% Error 


Time (secs) 


Note that the distance DV between cluster centers was computed twice; at initialization, using (7b) with 
U L ; and again at termination, using (7b) with Up CM . Figure 3 shows that DV increases and H decreases 

whenever E L , the labeling error rate, is close to zero. The tentative inference to be drawn from this graph 
is that FCM is able to differentiate between pre and post signals at some subintervals of the data collection 
epoch. These results suggest that either DV or H or both may be indicators of the error rate across such 
time steps (remember, there is no way to know what the error rate is with truly unlabeled data). However, 
one must view this conjecture with great caution; it would be foolish to impute much credence to this 
hypothesis on the basis of outputs from one subject on one trial. Our goal for the remainder of the project 
will be to wider our computational experience across both trials on the same subject, as well as across 
different subjects. Perhaps this initial conjecture will not stand up across different data sets; on the other 
hand, we feel that these initial results are at least encouraging. 
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